Corpus: pol_news_2008_300K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 99 99 99 99 99
1000 912 979 997 999 999
10000 7175 9230 9847 9963 9987
100000 41865 79097 94116 98438 99419
1000000 87154 208442 269003 290116 295957


Zipf's diagram for sentence endings


Gnuplot diagram

11611 msec needed at 2018-03-19 20:37